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METHOD AND SYSTEM FOR UNIFORM RESOURCE 
LOCATOR TRANSFORMATION 

TECHNICAL FIELD OF THE INVENTION 

This invention relates generally to the field of 
data processing and, more particularly, to a method and 
system for uniform resource locator transformation. 
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BACKGROUND OF THE INVENTION 

As computers have grown increasingly important in 
today's society, the use of the Internet has also 
increased. The increased usage of the Internet has lead 
5 to an decrease in response time as busy web sites attempt 
to handle a large volume of traffic. 

One solution to handling large amounts of traffic at 
popular web sites is to cache frequently requested 
content from the web site at the user's computer and/or 

10 at other sites. By spreading the content requests around 
an increased number of locations, performance is improved 
by decreasing the load at a single web site. 

One of the limitations to many existing caching 
solutions is the inability to cache content that varies 

15 based on information in the request header as opposed to 
the uniform resource locator (URL) indicating the content 
item. For example, a popular site such as Yahoo may 
provide localized versions of web pages based on the 
language information in the header of the hypertext 

2 0 transport protocol (HTTP) request while using the same 
URL. 
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SUMMARY OF THE INVENTION 

The present invention addresses the problems and 
disadvantages associated with prior systems. In 
particular, the present invention provides a method and 
5 system for uniform resource locator transformation. 

One aspect of the invention is a method for 
communicating data comprising receiving a first request 
at a cache server. The first request has an associated 
first content item and has an associated header portion. 
10 The method further includes comparing the first content 
item and the header portion to predefined criteria. In 
addition, the method includes generating a second request 
based on the criteria, the header portion and the first 
content item, the second request being associated with a 
15 second content item, and retrieving the second content 
item based on the second request. 

Another aspect of the invention is a system for 
communicating data comprising a computer readable memory 
and an application stored in the computer readable 

2 0 memory. The application is operable to receive a first 

request at a cache server. The first request has an 
associated first content item and has an associated 
header portion. The application is further operable to 
compare the first content item and the header portion to 
25 predefined criteria. In addition, the application is 
operable to generate a second request based on the 
criteria, the header portion and the first content item, 
the second request being associated with a second content 
item, and retrieve the second content item based on the 

3 0 second request. 

The present invention provides important technical 
advantages. Various embodiments of the invention may 
have none, some, or all of these advantages. One such 
technical advantage is the capability for transforming 
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the uniform resource locators (URLs) portion of a 
hypertext transport protocol (HTTP) request using 
information in the header portion of the HTTP request. 
Including header information in the URL portion allows 
for the caching of header dependent content items. Thus, 
a site which returns different content based on, for 
example, language information in the header portion, 
using the same URL of, for example, 

"http://www.domain.com/" may now be cached. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present 
invention, the objects and advantages thereof, reference 
is now made to the following descriptions taken in 
5 connection with the accompanying drawings in which: 

FIGURE 1 is a block diagram illustrating a uniform 
resource locator transformation system; 

FIGURE 2 is a chart illustrating an exemplary 
embodiment of transform criteria used with system of 
10 FIGURE 1 for transforming uniform resource locators; and 

FIGURE 3 is a flow chart illustrating a method of 
transforming URLs using the system of FIGURE 1. 
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DETAILED DESCRIPTION OF THE INVENTION 

A better understanding of the present invention will 
be realized from the detailed description that follows, 
taken in conjunction with the accompanying drawings. 
5 FIGURE 1 is a block diagram illustrating a uniform 

resource locator (URL) transformation system 10. System 
10 comprises a client 12, a network 14, a cache server 16 
and an origin server 18. System 10 is operable to 
provide the capability to transform URLs used with the 

10 hypertext transport protocol (HTTP) and the hypertext 
markup language (HTML) such that information used by a 
World Wide Web (web) server in the HTTP header is 
included in the uniform resource identifier (URI) portion 
of an HTTP request. Typically, caching systems use the 

15 URI portion of an HTTP request to identify cached content 
items. However, caching systems are reduced in 

effectiveness when origin servers 18 provide differing 
content based on information in the HTTP header portion 
of the HTTP request because the cache system caches on 

2 0 the URI, not the header information. System 10 provides 

the capability to transform the URI portion of the HTTP 
request to include information from the header portion of 
the HTTP request so that caching systems can cache 
header -dependent content items. 
25 Client 12 comprises a general purpose or specialized 

computing system operable to receive information from a 
user (not shown) , such as a Microsoft Windows based 
computer, a UNIX or Linux based computer or a Macintosh 
computer. Client 12 may also comprise a web browser 2 0 

3 0 stored in a computer readable memory 22 and executed by a 

processor 24. Web browser 20 comprises any suitable HTTP 
client and is operable to generate one or more HTTP 
requests 26. Browser 20 communicates with cache server 
16 over network 14, and transmits and receives data to 
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and from server 16. HTTP request 26 comprises a URI 
portion 28 and a header portion 30. 

In the disclosed embodiment, URI portion 2 8 
comprises a URL identifying the content requested by 
browser 20. For example, URI portion 2 8 may comprise 
"http://www.domain.com/index.html". In the disclosed 

embodiment, header portion 3 0 comprises header 
information included in an HTTP request. For example, 
header portion 3 0 may comprise "content type 
application/x-zip" or "language = en-us" . 

Typically, users of a web browser, such as browser 
20, identify a desired web page by the URL associated 
with the web page. In addition, a link to a second web 
page from a first web page specifies the second web page 
by the URL associated with the second web page. However, 
when the URL seen by the user is communicated to the 
server hosting the desired web page, the URL seen by the 
user is not the only information communicated. More 
specifically, the URL seen by the user forms a portion of 
the full http request sent to the web server hosting the 
desired web page . 

The full http request includes a header, such as 
header portion 30, and the URL seen by the user, such as 
URI portion 28. The header includes various information 
not included in the URL which is useful and sometimes 
necessary for proper handling of the http request and 
proper presentation to the user of the content referred 
to by the URL in the http request. The information 
included in the header may include the identity 
(formally, the "User-Agent") of the browser being used, 
such as Internet Explorer version 4.2 or, more formally, 
"CERN-LineMode/2 . 15 libwww/2 . 17b3 " . The header may also 
indicate the language (formally " Content -Language " ) that 
the user of the browser prefers, such as English or 
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Japanese. The header may also indicate various other 
fields such as cache -control , connection, date, pragma, 
trailer, transfer-encoding, upgrade, via, warning, 

allow, content -encoding, content -length, content- 
5 location, content -md5, content -range , content -type , 
expires, last-modified, extension-header, accept, accept- 
charset, accept -encoding , accept - language , authorization, 
expect, from, host, if-match, if -modif ied-since, if-none- 
match, if-range, if -unmodif ied-since, max-f orwards, 

10 proxy-authorization, range, referer and other header 
information. For more detailed information regarding 
http requests and http header information, see Request 
For Comments 2 616 available at 

<http://www.w3.org/Protocols/> which is incorporated 

15 herein by reference. Often, header information is based 
on the choices made by a user of a web browser in the 
"Preferences" or "Options" portion of the web browser. 

Many web sites use the header information to select 
the content to present to the user in response to the 

2 0 http request. Note that for increased clarity, the 
following example may deviate from the formal 
requirements of an http request, however, properly 
formatted http requests are included within the scope of 
the invention. For example, 

2 5 "http://www.server.com/index.html" is a URL requesting 

the "index.html" content at "www.server.com". The URL is 
part of a first http request which may also indicate 
"Content -Language = Japanese" in the header portion to 
indicate that the user of the web browser which generated 

3 0 the http request prefers content to be presented in the 

Japanese language. When the web server at 

"www.server.com" receives the above first http request, 
the web server may examine both the header portion and 
the URL portion in determining which content to present . 
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More specifically, multiple "index.html" web pages may be 
maintained at the web server for different languages. 
Thus, in response to the first http request, the web 
server provides the Japanese language version of 
5 "index.html" based on the information in the header 
portion. Continuing the above example, a second http 
request may have the same URL 

("http://www.server.com/index.html"), but "Content- 

Language = English" . In response to the second http 

10 request, the web server may provide an "index.html" page 
in the English language. This example illustrates how a 
single URL ("http://www.server.com/index.html") can be 
interpreted at a web server to return distinct web pages 
based on information in the header portion of the http 

15 request. 

One problem experienced by existing caching systems 
is the inability to cache content that changes based on 
header information. This problem exists because caching 
systems typically identify cached content using only the 

2 0 URL associated with the content, not the header 

information. When "http://www.server.com/index.html" 

actually identifies multiple web pages, the cache system 
is unable to determine which of the multiple web pages to 
return because the header information is not included in 
25 the URL. The teachings provided herein involve a method 
and system for uniform resource locator transformation 
which would allow a cache system to cache header- 
dependent content, such as the 
"http://www.server.com/index.html" content described 

3 0 above . 

Network 14 comprises a data communication system 
operable to communicate data between client 12, cache 
server 16 and origin server 18. Network 14 is shown in 
FIGURE 1 in two parts only to indicate a logical 
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distinction between the communications path following 
between client 12 and cache server 16, and cache server 
16 and origin server 18. However, network 14 may 
comprise a single network. For example, network 14 may 
5 be the Internet, an asynchronous transfer mode ("ATM") 
network, an Ethernet network, a Transmission Control 
Protocol/Internet Protocol (TCP/IP) network, an intranet 
or any other suitable computer networking technologies. 
For purposes of teaching the present invention, an 

10 exemplary embodiment will be described where network 14 
comprises the Internet. 

Cache server 16 comprises any suitable combination 
of hardware and/or software operable to cache static 
and/or dynamic content items. Cache server 16 further 

15 comprises transform criteria 40. Transform criteria 40 
comprises rules and directives for transforming URLs 
received from clients 12 based on web browser 20, the 
destination origin server 18, and other information. For 
example, transform criteria 4 0 may comprise a table 

2 0 indicating the transformations to be applied. Cache 
server 16 is further operable to provide web server 
capabilities similar to those of origin server 18 for 
handling requests 26. 

Origin server 18 communicates with cache server 16 

25 over network 14. Origin server 18 comprises any suitable 
hardware and/or software executing on a computer for 
receiving and responding to requests 26. Origin server 
18 may comprise a single computer executing software or 
may comprise a plurality of computers each executing 

30 software. In the disclosed embodiment, origin server 18 
comprises an HTTP server which may also be known as a web 
server. Origin server 18 may additionally support other 
protocols such as the file transfer protocol (FTP) . 
Origin server 18 retrieves information from one or more 
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data sources 19 in response to requests 26. Origin 
server 18 is operable to retrieve static content, such as 
prewritten text files, images, and web pages, from data 
sources 19 in response to requests 26. Origin server 18 
5 is also operable to generate new, dynamic content, for 
example, by dynamically creating web pages based on 
content stored in the data sources 19 in response to 
requests 26. For example, origin server 18 may generate a 
new web page using a common gateway interface (CGI) 

10 script, generate a new web page from the result of a 
structured query language (SQL) request and perform other 
suitable content generation functions. Origin server 18 
may also be operable to generate executable software, 
such as applications and applets, in response to requests 

15 for data. For example, origin server 18 may generate a 
Java applet in response to an appropriate request 26. 

In operation, browser 20 generates request 26 for 
content from origin server 18. Cache server 16 

intercepts request 26 from web browser 20 to origin 

20 server 18. For example, cache server 16 may intercept 
request 2 0 by having the domain name service (DNS) server 
direct request 26 for the Internet domain associated with 
origin server 18 to cache server 16. Stated another way, 
requests 2 6 addressed to origin server 18 may be routed 

25 to cache server 16 through the operation of the DNS 
server . 

After receiving request 26, cache server 16 examines 
URI portion 2 8 and header portion 3 0 and compares it to 
transform criteria 40. Criteria 40 specifies which 
3 0 requests 2 6 to perform transformation on and how to 
transform selected requests 26. Criteria 40 may identify 
requests 2 6 to be transformed based on information in 
either or both of URI portion 28 and header portion 30. 
More specifically, criteria 40 may indicate that 
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information be added to request 2 6 and/or that 
information be deleted from request 26. For example, 
criteria 4 0 may specify that if "www.domain.com" is found 
in URI portion 28 and "language = Japanese" is found in 
5 header portion 30, that request 2 6 be transformed to 
include the language information found in header portion 
3 0 in URI portion 2 8 so that cache server 16 can cache 
the content associated with request 26. For another 
example, criteria 40 may specify that if 

10 "www. domain. com/acct . cgi?user_id=Mike" is found in URI 
portion 28, that request 26 be transformed to remove the 
user_id information from URI portion 28. By removing 
information, user dependent data, such as user 
identifiers, may be removed from URI portion 28 and 

15 allowing more efficient caching of web pages. More 
specifically, by removing the user dependent identifier 
information in the above example, only one copy of the 
page associated with 

" www. domain. com/acct . cgi?user_id=Mike" need be cached 

20 because once the user identifier information is removed, 

only "www.domain.com/acct.cgi" may remain which would 
indicate a single web page instead of multiple web pages. 
In general, any of the elements of header portion 30, 
such as cache - control , connection, date, pragma, 

25 trailer, transfer-encoding, upgrade, via, warning, 

allow, content-encoding, content -length, content- 
location, content -md5, content - range , content -type, 
expires, last-modified, extension-header, accept, accept- 
charset, accept-encoding, accept -language, authorization, 

30 expect, from, host, if-match, if -modified- since , if-none- 
match, if-range, if -unmodif ied-since, max- forwards , 
proxy-authorization, range, referer [sic] and other 
header fields, may be associated with any portion, 
substring or other part of URI portion 2 8 to indicate how 
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to transform requests 26. Further details of criteria 40 
are discussed in association with FIGURE 2. Typically, 
criteria 40 for a particular origin server 18 will be 
configured using information from an administrator 
5 associated with the origin server 18 as the administrator 
will know which particular web pages and other content 
are identified by a single URL, but depend on header 
information . 

After request 26 has been transformed according to 

10 transform criteria 40, cache server 16 examines URI 
portion 2 8 to determine whether the requested content has 
been cached at cache server 16. Since URI portion 28 has 
been transformed, any header information needed to 
identify the particular content to be retrieved is now 

15 available in URI portion 28. Cache server 16 can then 
uniquely identify content that is header dependent using 
only the URI portion 28 to identify the content. For 
example, a initial URI portion 2 8 of 

"http://www.server.com/index.html" and a header portion 

2 0 3 0 of "Language = Japanese" may have been transformed 
into a new URI portion 2 8 of 

"http : //www. server . com/ index . html ?Language= Japanese " to 
identify the Japanese language version of the 
"http://www.server.com/index.html" web page. Continuing 

2 5 the above example, a further request 2 6 with an initial 
URI portion 2 8 of "http://www.server.com/index.html" and 
a header portion 3 0 of "Language = English" may have been 



transformed 


into a new URI 


portion 28 


of 


"http : //www 


. server . com/ index. html?Language=English" 


to 


identify 


the English language 


version of 


the 


"http : / /www 


. server . com/ index. html " 


web page . 


This 


example 


illustrates how 


a single 


URL 


"http : //www 


. server . com/ index . html " 


which refers 


to 


multiple, 


language dependent web pages, may 


be 
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transformed to distinctly identify the multiple web pages 
for caching purposes. 

If the requested content has been cached at cache 
server 16, cache server 16 responds to request 2 6 with 
5 the cached copy of the requested content. If the 
requested content is not cached at cache server 16, cache 
server 16 communicates the untransf ormed version of 
request 26 to origin server 18. Cache server 16 next 
receives the requested information from origin server 18. 
10 Cache server 16 then transforms request 2 6 so as to be 
able to cache the requested content at cache server 16 
and to do so transparently to origin server 18. Cache 
server 16 then communicates the requested content to web 
browser 20. 

15 FIGURE 2 is a table illustrating an exemplary table 

for transform criteria 40, however, any other suitable 
implementation for criteria 4 0 may also be used. 
Transform criteria 40 may comprise match criteria 100 and 
an associated transform 102. Match criteria 100 

2 0 comprises the information in URI portion 2 8 and header 
portion 3 0 that cache server 16 uses to determine which 
transform criteria 40 to apply to request 26. Match 
criteria 100 may comprise a URL such as "www.domain.com", 
which would match that specific machine and domain 

2 5 regardless of the content requested from that domain. 

Match criteria 100 may also comprise a partial domain 
name such as ".domain.com" which would match any machine 
name within domain.com. Match criteria 100 may also 
utilize wild cards such as "*" and "?". More 

3 0 specifically, the "*" wildcard may indicate any number of 

any character while the "?" wildcard may indicate a 
single occurrence of any character, other wildcards may 
also be used. For example, " . *domain . com" would match 
any machine in any domain ending with "domain.com" where 
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* represents any number of characters preceding 
domain.com, thus, "www.mydomain.com" would be matched. 
For another example, ".domain.*" would match any machine 
in any generic top level domain (gTLD) with a domain of 
5 "domain." For further example, ".domain*.*" would match 
any machine name in a domain which started with "domain" 
and ended with any number of characters in any gTLD, 
thus, "www.domainofmine.com" would be matched, but not 
"www.mydomain.com". For yet another example, 

10 ".domain?.com" would match any machine name in any domain 
starting with the word "domain" and having a single 
additional character after the word "domain", thus, 
"www.domainl.com" would be matched, but not 
"www.domain2 2.com". Match criteria 100 may also use wild 

15 cards in the content portion of the URL. For example, 
". domain. com/ data/car* . html " would match any machine name 
in "domain.com" which requested HTML content items in the 
data directory which started with the word "car" and 
followed by any number of characters, thus, 

20 "www.domain.com/data/car_porche.html" would be matched. 

In general, any suitable wildcard scheme, such as regular 
expression matching, may be used with match criteria 100. 

Criteria 40 may also be used to indicate 
transformations based on the referring web page. Thus, 

2 5 differing content may be provided to users based on how 

the user arrived at origin server 18. When a referring 
page includes referral information in URI portion 2 8 
and/or in header portion 30, criteria 4 0 may indicate a 
transformation such that differing content is provided 

3 0 based on the referring page. For example, a request 2 6 

with URI portion of "http://www.bookstore.com/index.html" 
and a header portion 30 including "referer 
http://www.preferredsite.com" may be transformed to 
"http : / /www. bookstore . com/pref erredcustomer_index . html " . 
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Request 26 may be transformed such that users who arrive 
at "www.bookstore.com" from "www.preferredsite.com" 
receive preferential treatment because of, for example, a 
partnership between "www.bookstore.com" and 

5 "www.preferredsite.com". In addition, since the 

"referer" header field may be set regardless of the 
contents of URI portion 28, web sites can indicate 
actions to take based on the referring site regardless of 
the referring site's assistance. In contrast to existing 

10 referral programs which typically require the referring 
site to include identifying information in the URL, by 
using information in the referring field of header 
portion 30, the referring site can be identified without 
participation on the part of the referring site. 

15 Further, if a web site wishes to exclude visitors from a 
particular referring site, the transformation specified 
in criteria 4 0 for a particular URI portion 2 8 and a 
particular referring site in header portion 3 0 may direct 
the user to an access denied page so that users from that 

2 0 particular referring site may be excluded. 

Transform 102 comprises an indication of what action 
to take for the particular match criteria 100. Each 
transform 102 is associated with one or more match 
criteria 100. Transform 102 may also indicate multiple 
25 actions to be taken. For example, transform 102 may 
indicate that data in the "language field" be appended to 
the end of the URI portion 28. For another example, the 
"browser type" information in header 30 may be inserted 
in URI portion 28. Transform 102 may also indicate 

3 0 information to be removed from the matched URL. For 

example, transform 102 may indicate that a "user_id" 
element of URI portion 2 8 be removed. 

FIGURE 3 is a flow chart illustrating a method for 
transforming requests 2 6 at cache server 16. The method 
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begins at step 200 where request 26 is received at cache 
server 16. Next, at step 202, cache server 16 examines 
request 26 against match criteria 100 in transform 
criteria 40. 

5 Then, at decisional step 2 04, cache server 16 

determines whether request 2 6 should be changed. More 
specifically, if URI portion 28 and/or header portion 30 
of request 2 6 matches any one or more of match criteria 
100, cache server 16 will apply transforms 102 associated 

10 with the matched element of match criteria 100. For 
example, a request 2 6 with a URI portion 2 8 including 
"server.domain.com" may match the second exemplary entry 
of match criteria 100 (FIGURE 2) . Once a match has been 
found between one or more entries of match criteria 100 

15 and request 28 of request 26 then server 16 will 
transform request 26. More specifically, if a match is 
found between request 26 and match criteria 100 then the 
YES branch of decisional step 204 leads to step 206. At 
step 206, cache server 16 transforms URI portion 28 of 

2 0 request 2 6 according to each match criteria 100 matched 

in step 204 according to the transform 102 associated 
with the matched elements of match criteria 100. If no 
matches are found between URI portion 2 8 and match 
criteria 100 in step 204 then the NO branch of decisional 
25 step 204 leads to decisional step 208. 

At decisional step 208, URI portion 28, which may 
have been transformed in step 206, is used to determine 
whether the content requested by request 2 6 is cached at 
cache server 16. If the requested content is not cached 

3 0 at cache server 16 then the NO branch of decisional step 

208 leads to step 210. At step 210, cache server 16 
forwards the untransf ormed version of request 2 6 to 
origin server 18 and retrieves the requested content. 
Next, at step 212, the requested content item is cached 
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at cache server 16, if the requested content item is 
cacheable. As some content items are uncacheable or have 
been specified as uncacheable, the content item may not 
be cached. At step 214 the requested content item is 
5 returned to client 12. If the requested item is cached 
at cache server 16 then the YES branch of decisional step 
208 is followed to step 214. 

System 10 provides the capability to transform URLs 
so that header -dependent content may be cached by at a 

10 cache server. Existing web content caching systems which 
rely on the URL associated with the web content have 
previously lacked the capability to cache content which 
varied based on header information in the http request 
associated with the URL. System 10 provides the 

15 capability to transform the URL associated with header- 
dependent content such that caching systems can uniquely 
identify the header- dependent content using the 
associated URL. 

In addition, system 10 may be used to provide 

20 additional functionality to cache servers and other 
systems. For example, the ability to include the URL of 
a referring site in a request ' s URL without the 
assistance of the referring site provides the capability 
to site administrators to exclude visitors from 

25 particular sites. 

Other changes, substitutions and alterations are 
also possible without departing from the spirit and scope 
of the present invention, as defined by the following 
claims . 
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WHAT IS CLAIMED IS: 

1. A method for communicating data comprising: 
receiving a first request at a cache server, the 

first request having an associated first content item and 
5 having an associated header portion; 

comparing the first content item and the header 
portion to predefined criteria; 

generating a second request based on the criteria, 
the header portion and the first content item, the second 
10 request being associated with a second content item; and 

retrieving the second content item based on the 
second request. 

2. The method for communicating data according to 
15 Claim 1, wherein the header portion comprises a hypertext 

transport protocol header portion and wherein comparing 
the request comprises: 

examining a hypertext transport protocol identifier 
portion associated with the first content item; 
2 0 comparing the hypertext transport protocol 

identifier portion to the criteria; 

examining the hypertext transport protocol header 
portion associated with the first request; and 

comparing the hypertext transport protocol header 
25 portion to the criteria. 

3 . The method for communicating data according to 
Claim 2, wherein the predefined criteria comprises a 
match criteria and an associated transform. 

30 
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4. The method for communicating data according to 
Claim 3, wherein the transform comprises at least one 
rule indicating how to modify the hypertext transport 
protocol identifier portion associated with the first 

5 request to generate the second request. 

5. The method for communicating data according to 
Claim 3, wherein the transform comprises at least one 
rule indicating an element associated with the hypertext 

10 transport protocol header portion of the first request to 
be associated with the hypertext transport protocol 
identifier portion of the second request. 

6. The method for communicating data according to 
15 Claim 3, wherein the match criteria comprises at least 

one entry, each entry comprising a portion of a hypertext 
transport protocol identifier and comparing the hypertext 
transport protocol identifier portion to the criteria 
comprises comparing each entry to the hypertext transport 
20 protocol identifier portion of the first request. 

7. The method for communicating data according to 
Claim 1, wherein retrieving the second content item 
comprises : 

2 5 retrieving the second content item based on the 

second request from the cache server when the second 
content item is available from the cache server; and 

retrieving the first content item based on the first 
request from the origin server when the second content 

3 0 item is unavailable from the cache server. 

8 . The method for communicating data according to 
Claim 7, wherein the second content item is related to 
the first content item. 
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9. The method for communicating data according to 
Claim 7, wherein the second content item comprises a 
version of the first content item customized in response 
to data in the header portion associated with the first 

5 request . 

10. The method for communicating data according to 
Claim 1, wherein generating the second request comprises: 

adding a hypertext transport protocol identifier 
10 portion of the first request to a hypertext transport 
protocol identifier portion of the second request; and 

associating an element associated with the header 
portion associated with the first request with the 
hypertext transport protocol identifier portion of the 
15 second request. 
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11. A system for communicating data comprising: 
a computer readable memory; 

an application stored in the computer readable 
memory and operable to: 
5 receive a first request at a cache server, the 

first request having an associated first content item and 
having an associated header portion; 

compare the first content item and the header 
portion to predefined criteria; 
10 generate a second request based on the 

criteria, the header portion and the first content item, 
the second request being associated with a second content 
item; and 

retrieve the second content item based on the 
15 second request. 

12 . The system for communicating data according to 
Claim 11, wherein the header portion comprises 'a 
hypertext transport protocol header portion and wherein 
20 the application is further operable to: 

examine a hypertext transport protocol identifier 
portion associated with the first request; 

compare the hypertext transport protocol identifier 
portion to the criteria; 
2 5 examine the hypertext transport protocol header 

portion associated with the first request; and 

compare the hypertext transport protocol header 
portion to the criteria. 

30 13. The system for communicating data according to 

Claim 12, wherein the predefined criteria comprises a 
match criteria and an associated transform. 
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14 . The system for communicating data according to 
Claim 13, wherein the transform comprises at least one 
rule indicating how to modify the hypertext transport 
protocol identifier portion associated with the first 
5 request to generate the second request . 

15. The system for communicating data according to 
Claim 13, wherein the transform comprises at least one 
rule indicating an element associated with the hypertext 

10 transport protocol header portion of the first request to 
be associated with the hypertext transport protocol 
identifier portion of the second request. 

16. The system for communicating data according to 
15 Claim 13, wherein the match criteria comprises at least 

one entry, each entry comprising a portion of a hypertext 
transport protocol identifier and comparing the hypertext 
transport protocol identifier portion to the criteria 
comprises comparing each entry to the hypertext transport 
2 0 protocol identifier portion of the first request. 

17. The system for communicating data according to 
Claim 11, wherein the application is further operable to: 

retrieve the second content item based on the second 
2 5 request from the cache server when the second content 
item is available from the cache server; and 

retrieve the first content item based on the first 
request from the origin server when the second content 
item is unavailable from the cache server. 

30 

18 . The system for communicating data according to 
Claim 17, wherein the second content item is related to 
the first content item. 
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19. The system for communicating data according to 
Claim 17, wherein the second content item comprises a 
version of the first content item customized in response 
to data in the header portion associated with the first 

5 request . 

20. The system for communicating data according to 
Claim 11, wherein the application is further operable to: 

add a hypertext transport protocol identifier 
10 portion of the first request to a hypertext transport 
protocol identifier portion of the second request; and 

associate an element associated with the header 
portion associated with the first request with the 
hypertext transport protocol identifier portion of the 
15 second request . 
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METHOD AND SYSTEM FOR UNIFORM RESOURCE 
LOCATOR TRANSFORMATION 

ABSTRACT OF THE DISCLOSURE 
5 A web browser (2 0) at a client (12) generates a 

request (26) for content from an origin server (18) . A 
cache server (16) intercepts the request and examines the 
request to determine whether the request should be 
transformed. More specifically, the cache server 

10 compares transform criteria (40) to a uniform resource 
identifier portion (28) and a header portion (30) 
associated with the request. By transforming the request 
such that information in the header portion is included 
in the uniform resource identifier portion, the cache 

15 server can cache the content associated with the request 
using the uniform resource identifier portion of the 
request . 
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